Frame rate and viseme analysis for multimedia applications

نویسندگان

  • Jay J. Williams
  • Janet C. Rutledge
  • Dean C. Garstecki
  • Aggelos K. Katsaggelos
چکیده

In the future multimedia technology will be able to provide video frame rates equal to or better than 30 frames-per-second (FPS). Until that time the hearing impaired community will be using band-limited communication systems over un-shielded twisted pair copper wiring. As a result multimedia communication systems will use a coder/decoder (CODEC) to compress the video and audio signals for transmission. For these systems to be usable by the hearing impaired community, the algorithms within the CODEC have to be designed to account for the perceptual boundaries of the hearing impaired. In this paper we investigate the perceptual boundaries of speechreading and multimedia technology, which are the constraints that e ect speechreading performance. We analyze and draw conclusions on the relationship between viseme groupings, accuracy of viseme recognition, and presentation rate. These results are critical in the design of multimedia systems for the hearing impaired.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Based Persian Viseme Clustering

Viseme (Visual Phoneme) clusterin every language is among the most important conducting various multimedia researches as reading, lip synchronization and com pronunciation training applications. With re that clustering and analyzing visemes are lan processes, we concentrated our research on P which indeed has suffered from lack of su paper, we used a hierarchical approach for c in Persian langu...

متن کامل

Persian Viseme Classification Using Interlaced Derivative Patterns and Support Vector Machine

Viseme (Visual Phoneme) classification and analysis in every language are among the most important preliminaries for conducting various multimedia researches such as talking head, lip reading, lip synchronization, and computer assisted pronunciation training applications. With respect to the fact that analyzing visemes is a language dependent process, we concentrated our research on Persian lan...

متن کامل

Variability Analysis for Multi - programmed Multimedia Applications CS 497 JT - Spring 2001

Multimedia applications are expected to form a large part of the workload on a growing number of systems, including future handheld computers, wireless telephones, laptop computers, and desktop systems [8, 9, 16, 17]. General-purpose processors (vs. specialized DSP processors or ASICs) are expected to be increasingly employed for such workloads [11, 8, 9]. Our previous work analyzed the variabi...

متن کامل

Toward Clustering Persian Vowel Viseme: A New Clustering Approach based on HMM

This paper sorts out the problem of Persian Vowel viseme clustering. Clustering audio-visual data has been discussed for a decade or so. However, it is an open problem due to shortcoming of appropriate data and its dependency to target language. Here, we propose a speaker-independent and robust method for Persian viseme class identification as our main contribution. The overall process of the p...

متن کامل

A Fuzzy Based Approach for Rate Control in Wireless Multimedia Sensor Networks

Wireless Multimedia Sensor Networks (WMSNs) undergo congestion when a link (or a node) becomes overpopulated in terms of incoming packets. In WMSNs this happens especially in upstream nodes where all incoming packets meet and directed to the sink node. Congestion in networks, if not handled properly, might lead to congestion collapse which deteriorates the quality of service (QoS). Therefore, i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997